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Abstract 

In this paper, diversity analysis of bit-interleaved coded multiple beamforming (BICMB) is extended to the 
case of general spatial interleavers, removing a condition on their previously known design criteria and quantifying 
the resulting diversity order. The diversity order is determined by a parameter Qmax which is inherited from the 
convolutional code and the spatial de-multiplexer used in BICMB. We introduce a method to find this parameter 
by employing a transfer function approach as in finding the weight spectrum of a convolutional code. By using this 
method, several Qmax values are shown and verified to be identical with the results from a computer search. The 
diversity analysis and the method to find the parameter are supported by simulation results. By using the Singleton 
bound, we also show that Qmax is lower bounded by the product of the number of streams and the code rate of 
an encoder The design rule of the spatial de-multiplexer for a given convolutional code is proposed to meet the 
condition on the maximum achievable diversity order 

I. INTRODUCTION 

When the channel information is perfectly available at the transmitter, beamforming is an attractive tech- 
nique to enhance the performance of a multi-input multi-output (MIMO) system |[T1. A set of beamforming 
vectors is obtained by singular value decomposition (SVD) which is optimal in terms of minimizing the 
average bit error rate (BER) [|2]|. Single beamforming, which carries only one symbol at a time, was shown 
to achieve full diversity order of NM where is the number of transmit antennas and M is the number 
of receive antennas (H, JU. However, multiple beamforming, which increases the throughput by sending 
multiple symbols at a time, loses the full diversity order over flat fading channels. 
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To achieve the full diversity order as well as the full spatial multiplexing order, bit-interleaved coded 
multiple beamforming, combining bit-interleaved coded modulation (BICM) and multiple beamforming, 
was introduced in [5J. Design criteria for interleaving the coded sequence were provided such that each 
subchannel created by SVD is utilized at least once with a corresponding channel bit equal to 1 in an error 
event on the trellis diagram BICMB with 1/2-rate convolutional encoder, a simple interleaver 

and soft-input Viterbi decoder was shown to have full diversity order when it is used in a 2 x 2 system 
with 2 streams. In this paper, the diversity order is analyzed even when the interleaver does not meet 
the criteria of flSl, flU. To determine the diversity order, the error events that dominate BER performance 
need to be found. We introduce a method to find the dominant error events by extending a method from 
convolutional code analysis to determine system performance, e.g., [|71, (HI, into the analysis of the given 
combination of the interleaver and the code. We also show that for any convolutional code and any spatial 
de-multiplexer, the maximum achievable diversity order is related with the product of the code rate and 
the number of streams, by using the Singleton bound [QJ. The design rule of the spatial de-multiplexer to 
get the maximum achievable diversity order is also proposed. 

The rest of this paper is organized as follows. A brief review of the BICMB system is given in Section 
ini Section UlI] introduces a method to find a-vectors for a given convolutional code and the number of 
subchannels. Pairwise error probability (PEP) analysis is given in Section |IVl In Section |Vl the analysis 
of the maximum achievable diversity order of BICMB is shown, and the design rule of the spatial de- 
multiplexer for the maximum achievable diversity order is proposed. Simulation results supporting the 
analysis are shown in Section |VIl Finally, we end the paper with a conclusion in |VII[ 

II. BICMB Overview 

The code rate Rc = kc/uc convolutional encoder, possibly combined with a perforation matrix for a 
high rate punctured code, generates the codeword c from the information vector b. Then, the spatial 
de-multiplexer distributes the coded bits into S sequences, each of which is interleaved by an independent 
bit-wise interleaver. The interleaved sequences D are mapped by Gray encoding onto the symbol sequences 
Y. A symbol belongs to a signal set % C C of size \x\ = 2™, such as 2"*-QAM, where m is the number 
of input bits to the Gray encoder. 

The MIMO channel H G C*^^^ is assumed to be quasi-static, Rayleigh, and flat fading, and perfectly 
known to both the transmitter and the receiver. In this channel model, we assume that the channel 
coefficients remain constant for the L symbol duration. The beamforming vectors are determined by 
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the singular value decomposition of the MIMO channel, i.e., H = UAV where U and V are unitary 
matrices, and A is a diagonal matrix whose s*^ diagonal element, G M, is a singular value of H with 
decreasing order. When S symbols are transmitted at the same time, then the first S vectors of U and 
V are chosen to be used as beamforming matrices at the receiver and the transmitter, respectively. Let's 
denote the first S vectors of U and V as U and V. The system input-output relation at the k^'^ time 
instant for a packet duration is written as 



where is an S* x 1 vector of transmitted symbols, r^. is an S x 1 vector of the detected symbols, and 
rifc is an additive white Gaussian noise vector with zero mean and variance Nq = N/SNR. On each s*'^ 
subchannel, finally, we get 



where r^^s, yk,s, and hk^s are a detected symbol, a transmitted symbol, and a noise term, respectively. H 
is complex Gaussian with zero mean and unit variance, and to make the received signal-to-noise ratio 
SNR, the total transmitted power is scaled as N. The equivalent system model is shown in Fig. [U 

The location of the Z*'^ coded bit ci within the detected symbols is stored in a table / [k, s, i), where 
k, s, and i are time instant, subchannel, and bit position on a symbol, respectively. Let xl X where 
6 G {0, 1} in the i*^ bit position. By using the information in the table and the input-output relation in 
the receiver calculates the ML bit metrics as 



The combination of the ML bit metrics of ([3]) and U detector at the receiver is not the unique solution 
to get the optimum BER performance. Appropriate bit metrics corresponding to a linear detector, such as 
zero-forcing (ZF) or minimum mean square error (MMSE) detector, were shown to be equivalent to the 
bit metrics of ([3]) with U detector [[TOl . Finally, the ML decoder can make decisions according to the rule 



rfc = U^HVyfc + U^n^ 



(1) 



rk,s = >^syk,s + nk,s 



(2) 



Y{rk,s, ci) = min \rk,s - Kv 



(3) 




(4) 



III. a-SPECTRA 

The BER of a BICMB system is upper bounded by all the summations of each pairwise error probability 
for all the error events on the trellis Q, |l6l. Therefore, the calculation of PEP for each error event is 
needed to analyze the diversity order of a given BICMB system. If the interleaver is properly designed 
such that the consecutive long coded bits are mapped onto distinct symbols, the PEP between the two 
codewords c and c with Hamming distance is upper bounded as [[51 



P{c ^ c) = E[P{c ^ c|H)] 
1 



< E 



exp 



s=l 



4iVo 

V /J 



(5) 



where dmin is the minimum Euclidean distance in the constellation and denotes the number of times 
the s*^ subchannel is used corresponding to dn bits under consideration, satisfying Yl^^i as = du- Since 
PEP is affected by the summation of the products between and singular values as can be seen in ([5]), 
it is important to calculate the ^-vectors for each error path to have an insight into the diversity order 
behavior of a particular BICMB implementation. 

It has been shown in [[51, [|6|[ that for a single-carrier BICMB system, if the interleaver is designed such 
that, for all error paths of interest with Hamming distance du to the all-zeros path, 

1) the consecutive coded bits are mapped over different symbols, 

2) a, > 1 for 1 < s < 5, 

then the BICMB system achieves full diversity. In this paper, we will analyze cases where the sufficient 
condition «s > 1 may not be satisfied, i.e., as = for some s = 1, 2, ■ ■ ■ , 5 is possible. In order to carry 
out this analysis, as well as to get an insight into the system behavior in [[51, [[6l, one needs a method to 
calculate the values of Us (which we call the a-vector) of an error path at Hamming distance du to the 
all-zeros path. 

We developed a method to calculate the a-vectors for a convolutional code and interleaver combination. 
We will now illustrate this method with a simple example. For this example, the system is composed of a 
4-state 1 / 2-rate convolutional encoder and a spatial de-multiplexer rotating with an order of a, h, c, and d 
which represent the four streams of transmission. Fig. [2] represents a trellis diagram of this convolutional 
encoder for one period at the steady state. Since a convolutional code is linear, the all-zero codeword is 



assumed to be the input to the encoder. To find a transfer function of a convolutional code and a spatial 
de-multiplexer, we label the branches as a combination of a"^", b'^'', c^", and d'^'^, where the exponent (pi 
denotes the number of usage of the subchannel i which contributes to detecting the wrong branch by 
the detector. Additionally, Z^^ , whose exponent satisfies 0z = 0a + 06 + 0c + (pd, is included to get the 
relationship between the Hamming distance dn and a-vector of an error event. Furthermore, the non-zero 
states are arbitrarily labeled Xu through X23, while the zero state is labeled as Xi if branches split and 
Xo if branches merge as shown in Fig. [2l 



Let's denote x 
equation 
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. Then, the state equations are given by the matrix 



X = Fx + tXi 
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We also get 



abZ'^ cdZ^ 



X. 



The transfer function is represented in closed form by using the method in [8] as 

00 

T(a, b, c,d,Z) = g[I-F]-'t = gt + Y, gF't 

k=l 

= Z\a%''d + bcH^) 

+ Z%2a%c'd + a%''d^ + b^c^d^) 

+ Z\a^b'^ + 2a%''^d + 2a%^d''+ 

b'c'd'' + a^b^d"" + a^c^rf^) 
+ Z\a%\^ + Aa%\^d + Aa%\^d^+ 

b^C^d^ + a^C^rf^ ^ 4^2^^2^3 ^ ^2^2^4^ ^ . 



(6) 



(7) 



(8) 



where [I — F] ^ can be expanded as I + F + F^ H through an infinite series of power of matrices. The 
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weight spectrum, used for error performance analysis of convolutional codes, can be easily determined 
by T(a, b, c, d, Z) \a=h=c=d=i and can be compared with the literature IfTTTl . lfT2ll . 

Assume that a, 6, c, d are assigned to be the stream numbers 1, 2, 3, and 4, respectively. We can then 
figure out from the transfer function that the a-vectors of two error events with Hamming distance equal 
to 5 are [2 2 01] and [0122]. Besides, the vectors with ai equal to are easily found by choosing the 
terms composed of only h, c, and d, which are [0122], [0222], [0322], and [0 4 2 2]. No vector is found 
which has ai = 0:2 = or ai = 0^2 = as = 0. 

This method can be applied to any /C-state kd ric-rate convolutional code and S'-stream BICMB system. 
If the spatial de-multiplexer is not a random switch for the whole packet, the period of the spatial de- 
multiplexer is an integer multiple of the least common multiple (LCM) of ric and S. Note that we restrict 
a period of the interleaver to correspond to an integer multiple of trellis sections. Let's denote P = 
LCM{nc, S) which means the number of coded bits for a minimum period. Then, the dimension of the 
vector X is nP{IC — l)kc/nc where n is the integer multiple for a period of interest. 

By using this method, transfer functions of a 4-state 1/2-rate convolutional code with generator polyno- 
mials (5, 7) in octal combined with several different de-multiplexers are shown in (flOl) . and (fTTl) . The 
spatial de-multiplexer used in Ti and T2 is a simple rotating switch on 2 and 3 subchannels, respectively. 
For T3, i*'^ coded bit is de-multiplexed into subchannel Smod{i,i8)+i where si = ■ ■ ■ = sq = 1, s-/ = ■ ■ ■ 
= S12 = 2, Si3 = ■ ■ ■ = S18 = 3 and mod is the modulo operation. Throughout the transfer functions, the 
variables a, b, and c represent P*, 2"^^, and 3^'^ subchannel, respectively, in a decreasing order of singular 
values from the channel matrix. 

+ Z^(3aV + aV) 

+ Z«(aV + 6aV + aV) (9) 

+ Z\5a%^ + 10a%' + a%') 

+ Z'\a%^ + 15a%^ + lha%'^ + a^fe^) + ■ ■ • 



+ Z\a%'^c + a%'^c + a%c^+ 
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ab^c^ + a%c^ + ah'^e') 
+ Z\2a^h^c + 2a%^c^ + 2a^h^c^+ 

2a^c^ + 2a%'^c^ + 2ah^c^) (10) 
+ Z«(a^6=^ + a%\ + a^fo^c + 2a%\^+ 

Sa'b'c' + 2a%^c^ + a%c' + 3a''b'c^+ 

3a%^c^ + ah'^c^ + iPc^ + a%c^+ 

2aW + a6V + aV) + --- 

iP + a^c^ + h^c^ + a^c^ + h'^e' + c^) 

+ Z6(a^62 ^ 3^3^3 ^ ^2^4 ^ ^4^2 ^ 3^2^2^2^ 

fo^c^ + Sa^c^ + 36=^c3 + a^c^ + h^") (11) 

7a%'c^ + 2a^c' + a%c' + 7a'bh'+ 
ab^c^ + 2b^c^ + 2a^c^ + 2b^c^) + ■■■ 

Ti shows no term that lacks any of variables a and b, which means the interleaver satisfies the full 
diversity order criterion, > 1 for s = 1, 2 [|51, Most of the terms in T2 are comprised of three 
variables, a, b, and c. However, three error events with Hamming distance of 8 lack one variable, resulting 
in the a-vectors as [5 3 0], [0 5 3], and [3 5]. In T3, many terms missing one or two variables are observed. 
Especially, vectors with ag = for two subchannels can be found as [5 0], [0 5 0], and [0 5]. In Section 
HVl we present how these vectors affect the diversity order of BICMB. 

IV. Diversity Analysis 

Through the transfer functions in Section Hill we have seen interleavers which do not guarantee the full 
diversity criteria. As stated previously, contrary to the assumption in O that as > 1 for s = 1, 2, ■ ■ ■ ,5", 
we assume in this paper that it is possible to have = for some s = 1,2, ■ ■ ■ , S. Let's define 
Q^nzmin as the minimum a among the nonzero a's in the a-vector. Using the inequality J2s=i c^s-^s ^ 
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lin EL i.afe^o -^fc' ® '^^'^ expressed as 



P(c^c) < E 



(12) 



where = d'^i^anzmin/i^^No), K is the number of nonzero a's, is an index to indicate the k^^ 
nonzero a, and /i^ = A^. To solve (fT2)) . we need the marginal pdf / ■ ■ ■ by calculating 



o<3 ri^n rf^e(i)-2 rl^e(i) /"/'iv-i 



Jo 



Jo 



p(/il,-- - ,/iAr) 

X (i^jv ■ ■ ■ ■ ■ ■ ^/i2^/ii- (13) 



The joint pdf p (/ii, ■ ■ ■ , /iat) in (fT3l) is available in the literature lfT3l . [fT4ll as 



,/i7v) =p(/ii,--- ,/iiv)e 



E Mi 



(14) 



where the polynomial p (/ii, • ■ ■ ,/iAr) is 



AT 



2=1 jr>i 

Because we are interested in the exponent of W, the constant, which appears in the literature, is ignored 
in (fT5l) for brevity. 

Let's introduce / (/i£{i), ■ ■ ■ , fii^K)) which is defined as 



JO 



/3(/ii,-- - ,/iAr) 

X <^/iiv ■ ■ ■ ■ ■ ■ d^^d^^ (16) 



where /3 (/ii, ■ ■ ■ ,/iAr) is defined as 



p(/il,-- - ,pAr) 



p(pi,-- - ,pAr)e V y if ai = 

if 

p(pi,- ■■ ,pAf)e if ai > 0. 



(17) 



Then, we can see that p(pi,-- - ,pAr) < pIa^I;"'' jPa/) for either case of ai = or ai > be- 
cause e"^' < 1 for any i, and therefore / (p£(i), ■ ■ ■ , fie{K)) < / (p^(i), ■ ■ ■ ,P£(x))- The expressions for 
p(pi, ■ ■ ■ ,pAf) in (flVl) provide a convenience that is useful for the integration in (fT6l) by removing the 
exponential factors irrelevant to the variables of integration. 



For any case of p (/xi, ■ ■ ■ , /iat) in (fTTI) . / ■ ■ ■ , can be decomposed into two polynomials 

as 

K 

f ■ ■ ■ ,l^e{K)) = h ■ ■ ■ ,He(K)) X g ■ ■ ■ e '==1'''*'''. (18) 

The polynomial g ■ ■ ■ , jJi(K)) consists of factors irrelevant to the integration as 

K K 

g (p£(i) , ■ ■ ■ , = JJ /x*^^]^ JJ (p^(fc) - p^(j)) ^ (19) 

k=l j>k 

The other polynomial h (/i^(i), ■ ■ ■ , He{K)) for ai = is shown as 



00 

^0 ^0 ^0 JO 



and ■ ■ ■ , fJ'£{K)) for cti > is the same as in (|20|) except for the integrations over pj for 1 < i < 

— 1 as well as e^^^ removed. For ai = 0, e^'^^ and pi disappear after the integration, while /ii is 
present both in g ■ • ■ , fJ-eiK)) and (/^^(i), ■ ■ ■ , for ai > 0. The introduction in (fTTI) of e"''^ 

for ai = is needed to prevent (|20|) from diverging. 
Let's denote 

Then, r ■ ■ ■ , jJ^i{K)) is a polynomial with the smallest degree {M — Q + 1){N — Q + 1) — K where Q 

is an index to indicate the first nonzero a, that is, Q = £{1). The proof of this smallest degree is provided 
in the Appendix. Since / ■ ■ ■ , fii{K)) < / ■ ■ ■ , fJ'e{K)), the right side in (fT2l) is upper bounded 

as 



E exv\-Wy Uffk)] < I ■■■/ r ■ ■ ■ e c^m^k ) ' ' ' ^/^^(d • (22) 



exp [ -W^Hii^k) 



JO 



fc=i 

Note that 1 + W ^ W for high SNR. In addition, it can be easily verified that the following equality of 
a specific term in a polynomial for z/i > z/2 > • ■ ■ > i^i^ holds true; 

i/f ■ ■ ■ z/^^e fc-i d,^--- d,, = CW V ^^=1 ) (23) 
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where ^ is a constant. Since the polynomial r ■ ■ ■ , fie{K)) is a sum of a number of terms with 

different degrees, the result of (l22l) is a sum of the terms of W whose exponent is the corresponding 
degree. Furthermore, we are interested in the exponent of W to figure out the behavior of the diversity, 
not the exact PEP. Therefore, we can conclude that PEP is dominated by the term with the smallest degree 
of r ■ ■ ■ , He{K)) which is (M — Q + 1){N — Q + 1) — K , resulting in 



P(c ^ c) < r]W 



^{M-Q+l){N-Q+l) 

2 ^ \ -{M-Q+1){N-Q+1) 



= T] (^ "^rmn^" SNRj (24) 

where r/ is a constant. Fig. [3] shows the calculations of ^ corresponding to several specific a-vectors 
through Monte-Carlo simulation. Three dotted straight lines are PEP asymptotes at high SNR whose 
exponents correspond to 1,4, and 9. Regarding the exponent of PEP, we can see that the calculation of 
^ through simulation matches the analysis. 

For a rate kc/uc binary convolutional code and a fixed Gray-encoded constellation labeling map in a 
BICMB system, BER Pb can be bounded as 

oo Wi{d) 

P^^T Yl 9{d,Q{d,t),x) (25) 

d=dfree * = 1 

where g(-) is PEP corresponding to each error event, Wi{d) denotes the total input weight of error events 
at Hamming distance d, and Q is different for each error event. Since BER is dominated by PEP with 
the worst exponent term, the diversity order of a given BICMB system can be represented by 

O diversity = — Qmax + 1){N — Qmax + 1) (26) 

where Qmax is the maximum Q among the whole set of Q's corresponding to all of the error events. 

V. Relationship between Qmax and code rate 

The relationship between Qmax and the code with rate Rc is analyzed by using the same approach as 
in IfTSl which employes the Singleton bound to calculate the minimum distance of a non-binary block 
code. Let's define dE,sic, c) as the Euclidean distance between the mapped symbols of the two codewords 
residing on the s*'^ subchannel, dE,s{c-,c) = J2k=i IVk^s — yk,s\'^ where yk,s and yk,s are symbols on the 
s*'* subchannel at the time index k from the codeword c and c, respectively. If is equal to zero, then 
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all of the coded bits on the s*^ subchannel of the two codewords are the same. Since we assume that 
the consecutive bits are mapped over different symbols, the symbols corresponding to the same coded 
bits of the s*^ subchannel are also the same, resulting in dE,sic,c) = 0. Then, the parameter Q can be 
viewed as an index to the first non-zero element in a vector [rf^; i(c, c) c/^; 2(c, c) ■ ■ ■ (i^; .5(0, c)]. In 
the case of a pair of the codewords that has S —1 non-zero dE,s{c, c)'s, Q can be 2 because of the vector 
type [Ox X ■ ■ ■ x], or 1 from [x x x ■ ■ ■ x], [x x x x ■ ■ ■ x], ■ ■ ■ , [x x x ■ • ■ x 0], where x 
stands for non-zero value. In general, for a pair of the codewords that has 5h non-zero (i^; ,,(c, c)'s, Q is 
bounded as 

Q<S-Sh + 1. (27) 

If we consider the L symbols transmitted on each subchannel as a super-symbol over x^, then the 
transmitted symbols for all the subchannels in a block can be viewed as a vector of length S super- 
symbols. For convenience, we will call this vector of super-symbols as a symbol-wise codeword. We will 
now introduce a distance between c and c, which we will call Sh, as the number of non-zero elements 
in the vector [c?^; i(c, c) c/^; 2(0, c) ■ ■ ■ dE,s{Ci c)]. This distance is similar to the Hamming distance 
but it is between two non-binary symbol-wise codewords. By using the Singleton bound which is also 
applicable to non-binary codes, we can calculate the minimum distance of the symbol-wise codewords in 
a way similar to finding the minimum Hamming distance of binary codes. Let's define A4. as the number 
of distinct symbol-wise codewords. Then we can see that M. = 2™^"^^= from Fig.[T] Let k (0 < k < S — 1) 
denote the integer value satisfying 2™^('^^^) < Ai < 2™^'^. Since Ai > 2"^^('''~^), there necessarily exist 
two symbol- wise codewords whose k — 1 elements are the same. From the Singleton bound [9], the 
minimum distance of these symbol- wise codewords 5H,min is expressed as 5H,min < 5 — A; + L Since 

2mLSRc ^ 2"^^^^^^H,min + 'i-) gg(- 

^H,rmn < S - \ S ■ R,] + 1 (28) 

using the fact that 5H,min is an integer value. 

For a given BICMB system with 5H,min, it is true that the distance 5h between any pair of the codewords 
is always larger than or equal to the minimum distance 5H,min- By combining the inequalities of 5h > 
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,mm 



and (1271) . we get 5H,min < < S — Q + 1, leading to the following inequality as 



Q ^ S — SH,m.in + 1- 



(29) 



From the inequality (|29l) . the maximum Q among the whole set of Q's can be found as 



Q 



max 



S-6 



+ 1. 



(30) 



The inequality (|28l) and the equation (|30l) result in the following inequality as 




(31) 



The relationship (|3TI) can be supported by the examples in Section Hill where the 1/2-rate convolutional 
code is used in S* = 3 BICMB system with different spatial de-multiplexers. Since Qmax can be 2 or 3 
according to (|3T| ). the rotating spatial de-multiplexer used to calculate T2 in (flOl ) makes Qmax equal to 2 
while that of T3 in (fTTj) makes Qmax equal to 3. By considering the calculated diversity order of (|26l) . the 
maximum diversity order for a given code rate BICMB system is achieved by choosing the convolutional 
encoder and the spatial de-multiplexer satisfying Qmax = \S ■ -Rd • In this case, the maximum achievable 
diversity order is 



Based on (|32l ). Fig. |4] depicts the relationship between the code rate Rc, the number of streams 5*, and 
the maximum achievable diversity order. The whole combinations of S and Rc are divided into the four 
regions each representing the maximum achievable diversity order. For example, such combinations as 
{S, Rc) = (2, 1/2), (2, 1/3), (2, 1/4), (3, 1/3), (3, 1/4) in the region with the legend of MN achieve 
the full diversity order of MN. 

Since we assumed in the previous description that there exist the convolutional encoder and the spatial 
de-multiplexer which satisfy the relation Qmax = \S ■ Rc] , we will show the specific design method of 
the interleaver from a given convolutional encoder to ensure the relation. The following method is not the 
unique solution to guarantee the maximum achievable condition, but simple to state the concept. Let's 
consider a BICMB system with S subchannels and the code rate Rc = kc/uc convolutional code. Each 
of P = LCM(nc, S) coded bits is distributed to the S streams in the order specified by the interleaving 



Od^vers^ty = {M - \ S ■ Rc] +1){N-\S- Rc] + 1). 



(32) 
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pattern. Since each stream needs to be evenly employed for a period, P/ S coded bits are assigned on each 
stream. To guarantee Qmax = \S ■ -Rd ^ it is sufficient to consider only the first branches that split from 
the zero state in one period because of the repetition property of the convolutional code. We incorporate 
the basic idea that once the s*^ stream is assigned to an error bit of the first branch, obviously, all of 
the error events containing that branch give as > 0, resulting in Q < s. By extending this idea, we can 
summarize the assignment procedure as 

1) the lowest available subchannel is assigned to the error bit position of one of the first branches 
which have not yet assigned to any subchannel, 

2) the procedure 1) is repeated until all of the first branches are assigned to one of the subchannels. If 
all of the first branches are assigned to one of the subchannels, the assignment procedure quits after 
the rest of subchannels are assigned randomly to the unassigned bit positions, subject to satisfying 
the rate condition on each subchannel. 

We will explain the procedure above by using the example of Fig. [2] where P = 4 and the number 
of available assignment for each subchannel is P/S = 1. From the trellis, we can see there are 2 first 
branches that split from the zero state for one period. According to the procedure above, we need to assign 
the best subchannel to one of the first branches. In this example, let's assign it to the dummy variable a. 
This ensures ai > for all the error events stemming from this branch, resulting in Q = 1. For the second 
branch connecting Xj to Xu, we need to assign the next available lowest subchannel, which is 2, to the 
dummy variable c. As a result, we can see that Q < 2 for the error events that share this branch. Since 
all of the first two branches are assigned, the unassigned dummy variables are allocated randomly with 3 
and 4. This procedure assures that Qmax is equal to 1 or 2 for this BICMB system. On the other hand, 
Qmax > 2 from the equation (|3T| ) resulting from the Singleton bound. Therefore, this method guarantees 
Qmax = 2, which is the condition to achieve the maximum diversity order for the given convolutional 
code. 

VI. Simulation Results 

To show the validity of the diversity order analysis in Section |IV] using the parameter Qmax calculated 
by the method in Section |llll BER against SNR are derived through a Monte-Carlo simulation. Fig. |5] 
shows BER performances for the cases corresponding to Ti, T2, T3 in (|9l), (flOl ). and (fTTI) . The well- 
known reference curves achieving the full diversity order of MN are drawn from the Alamouti code for 
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the 2x2 case and 1/2-rate orthogonal space-time block code (OSTBC) for the 3x3 case |fT6l . From 
®, Qrnax fof Ti Is found to bc 1 bccausc > 1 for s = 1, 2 in all of the a-vectors. In this case, as 
predicted by the analysis in [[5]|, jH, the diversity order equals 4 by calculating (|26|) with M = N = 2. 
From the figure, we can see that BER curve for Ti is parallel to that of 2 x 2 Alamouti code. Since Qmax 
for T2 is 2 due to the vector [0 5 3], the calculated diversity order is 4 in the case of M = = S* = 3. 
This can be verified by Fig. [51 losing the full diversity order 9. Although the same number of subchannels 
and the same convolutional code as for T2 are used, the different spatial de-multiplexer from that of T2, 
described in Section [ni] for T3, gives no diversity gain at all. The reason for this is that the vector [0 5] 
which can be observed from the transfer function in T3 makes Qmax 3 resulting in the calculated diversity 
order of 1 in the equation (|26l) with M = A^ = S* = 3. This matches the simulation result. 

Table IJ shows results of a computer search of the a-vectors of BICMB with industry standard 64-state 
convolutional codes and a simple rotating spatial de-multiplexer. The generator polynomials for rates 1/2 
and 1/3 are (133, 171) and (133, 145, 175) in octal, respectively. For the high rate codes such as 2/3 and 
3/4, the perforation matrices in (T2\ are used from the 1/2-rate original code. Instead of displaying the 
whole transfer functions, we present only three a-vectors among such a number of dominant a-vectors 
that lead to Qmax- The search results comply with the bound Qmax > \S ■ Rc\ in (|3T1) as was analyzed 
in Section |Vl 

Fig. |6] shows the BER performance of the 2 x 2 S* = 2 BICMB system with the 64-state convolutional 
code and a simple rotating spatial de-multiplexer. The diversity orders of the systems with punctured codes 
are 1 because both Qmax values corresponding to the codes shown in Table H] are 2, while the system with 
the 1/2-rate convolutional code, whose Qmax is equal to 1, achieves full diversity order of 4. 

As shown in Fig. |71 for a 3 x 3 system with 3 streams, only 1/3-rate convolutional code achieves full 
diversity order of 9 since other codes have Qmax of larger than 1 as given in Table HI The analytically 
calculated diversity orders by using (|26l ) and Table H] are 4, 4, 1 for 1/2, 2/3, 3/4 respectively, which can 
be easily verified from Fig. |7] by being compared with the asymptotes. For the rate-3/4 code with the 
same spatial de-multiplexer, reducing one stream improves the performance dramatically. The diversity 
order of this case is 4 from the equation (|26l ) with M = A^ = 3 and Qmax = 2. 

VII. Conclusion 

In this paper, we investigated the diversity order of BICMB when the interleaver does not meet the 
previously introduced design criteria. We introduced a method to calculate the a-vectors from a given 
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convolutional code and a spatial de-multiplexer by using a transfer function. By using this method, the 
a-vectors that do not fulfill the full diversity order criteria are quantified. Then, the diversity behavior 
corresponding to the a-vectors was analyzed through PEP calculation. The exponent of PEP between two 
codewords is (M — Q + 1)(N — Q + 1) where Q is the first index to the non-zero element in the a-vector. 
Since BER is dominated by PEP with the smallest exponent, the diversity order is (M — Qmax + l)(iV — 
Qmax + 1), where Qmax Is the maximum among Q's corresponding to each a-vector. We provided the 
simulation results that verify the analysis. We also showed that Qmax is lower bounded by the product of 
the code rate and the number of streams. This result indicated that we need Rc < l/S to achieve the full 
diversity order of NM. 

Appendix 
Proof of the smallest degree 

Since r (/i^(i), ■ ■ ■ , I^e(k)) is a product of the two polynomials as shown in (|2T]) . the smallest degree of 
r (/i£(i), ■ ■ ■ , fie^K)) is a sum of the smallest degrees of each polynomial. Let's denote Dg^smaiiest as the 
smallest degree of the polynomial g ■ ■ ■ , IJ'e{K)) ■ It is easily found that all of the terms in (fT9l ) have 

the same degree. Therefore, 

Dg,,mallest = K{M - N) + K{K - 1) (33) 

where the degree of K{M — N) is contributed by the K factors of the form ^Y{k)^ ^ K[K — 1) comes 
from the (^) factors in the form of (yU^(A;) — fi^j)Y . 

To calculate the smallest degree of the polynomial h ■ ■ ■ , fJ^e{K)), we first focus on the case of 

ai = 0. The polynomial ■ ■ ■ ,/iAr) in (fT5l) has factors of the form fj^f^^^ and (^) factors of 

the form (/Xj — /Uj)^. The division by g ■ ■ ■ makes the common factors eliminated, leaving 

N — K and (^) — (^) factors of the form and — /ij)^, respectively. Hence, the resulting 

polynomial p (/xi, ■ ■ ■ , /"Jv) /g ■ • • , /"^(a-)) has degree 

Dh,org = {N- K){M -N) + N{N - 1) - K{K - 1). (34) 

The integration over fii for 1 < i < £(1) — 1 in (|20l ) makes the variables Hi, I < i < — 1 vanish. 
Since all the terms in p (/ii, ■ ■ ■ , hn) / g (/i^(i), ■ ■ ■ , Hi^k)) have different distributions on the degrees of 
the individual variables although they have the same degree as an entire term, the smallest degree of 
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h ■ ■ ■ , fii(K)) is determined by the term which has the largest degree of the vanishing variables of 

p {fii, ■ ■ ■ , fi^) / 9 (/^£{i), ■ ■ ■ , f^e{K)) ■ It's not necessary to find all the terms with the largest degree of the 
vanishing variables. Instead, we can see that one of those terms, whose degree is Dh,org, includes the 
following factors 

^(l)-l N 

n f^r-'^u^- (35) 

1=1 j>i 



In this case, the degree for the vanishing variables in (|35l) is 

Dh,vanishing = (^(1) - 1)(M - iV) + 2N{£{1) - 1) - " 1) (36) 

where (^(1) — — N) is contributed by the £(1) — 1 factors of the form , and the rest of the 

degrees are calculated from the factors of the form /if. 

Finally, the integration over jii for + 1 < i < N accumulates the degree of the current variables 
and adds up to the degree of the corresponding fik,k E T, where an ordered set T is defined as 
{i : ai > for 1 < i < S}. In addition, during the each integration, the degree increases by 1 due to 
the fact that J^^' = ^^'^^/{n + 1). Since — 1 variables from original N — K variables 
of integration vanished in h ^hk^k)), the degree to be added by the remaining variables of 

integration is 

DhMded = N-K- £(1) + 1. (37) 
The smallest degree of r (/^^(i), ■ ■ ■ , He(K)) is now ready to be calculated, which is 

smallest -Og,sma//est ~l~ D ii g^nallest 

D g ^smallest ~l~ i^-Dfi Q^g D fi yg^^igfii^g -f- Dji f^dd^fij 

= (M -£(!) + 1)(A^ -£(!) + !)- (38) 

where Dh,org — Dh,vanishing stands for the degree of the remaining non-vanishing variables of the term 
that leads to the smallest degree of h ■ ■ ■ , iJe(K))- 

In the case of «i > 0, the integrations over the variables /ij, 1 < i < £(1) — 1 in (|20l ) do not exist. 
Contrary to the case of «i = 0, no variable vanishes, resulting in Dh,vanishing = 0, and Dh,added = N — K. 
Equation (|38]) holds true for ai > since £(1) = 1 in this case. Therefore, for any case of ai, the smallest 
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degree of the polynomial r ■ ■ ■ , fJ^e{K)) is (M - £(1) + 1)(A^ - £(1) + 1) - K. 
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Fig. 1. Equivalent system model of BICMB 
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Fig. 2. Trellis of 4-state 1/2-rate convolutional code with 4 streams 
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Fig. 3. Monte-Carlo Simulation Results for 3 x 3 5 = 3 case with 4-QAM 
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Fig. 5. Simulation results for a 4-state 1/2-rate convolutional code with different spatial de-multiplexers. 4-QAM is used for all of the 
curves. 



TABLE I 

Search Results of the dominant a-VECTORS for 64-state convolutional codes 
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Fig. 6. Simulation results for the 2 x 2 case where 16-QAM is used for all of the curves 
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Fig. 7. Simulation results for 3 x 3 case where 16-QAM is used for all of the curves 



